A Matrix–Matrix Multiplication methodology for single/multi-core architectures using SIMD

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chemical Kinetics on Multi-core SIMD Architectures

Chemical kinetics modeling accounts for a significant portion of the computational time of atmospheric models. Effective application of multiple levels of heterogeneous parallelism can significantly reduce computational time, but implementation on emerging multi-core technologies can be prohibitively difficult. We introduce an approach for chemical kinetics modeling on multi-core SIMD architect...

متن کامل

Fast recursive matrix multiplication for multi-core architectures

In this article, we present a fast algorithm for matrix multiplication optimized for recent multicore architectures. The implementation exploits different methodologies from parallel programming, like recursive decomposition, efficient low-level implementations of basic blocks, software prefetching, and task scheduling resulting in a multilevel algorithm with adaptive features. Measurements on ...

متن کامل

Models for Simd, Mimd and Hybrid Parallel Architectures Models for Simd, Mimd and Hybrid Parallel Architectures

Realizzazione di modelli di varie architetture di calcolatori paralleli (di tipo SIMD, MIMD e ibridi) e loro validazione sperimentale To Floriana Acknowledgments I would like to thank my supervisor Prof. Giuseppe Serazzi for the inspiration , guidance, friendship ooered throughout my studies. I feel a special gratitude towards Peter King and Paolo Cremonesi for their important contribution to t...

متن کامل

Programming and compiling for embedded SIMD architectures

Declaration This dissertation is the result of my own work and includes nothing which is the outcome of work done in collaboration except where specifically indicated in the text. I confirm that this dissertation, including tables and footnotes, but excluding appendices, bibliography and diagrams, does not exceed the regulation length of 60 000 words. Summary This dissertation studies programmi...

متن کامل

Efficient fuzzy compiler for SIMD architectures

This paper presents a real-time full-programmable fuzzy compiler based on piecewise linear interpolation techniques designed to be executed in SIMD (Single Instruction Multiple Data) architectures. A fullprogrammable fuzzy processor is defined as a system where the set of rules, the membership functions, the t-norm, the t-conorm, the aggregation operator, the propagation operator, and the defuz...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The Journal of Supercomputing

سال: 2014

ISSN: 0920-8542,1573-0484

DOI: 10.1007/s11227-014-1098-9